Linear trees in uniform hypergraphs 
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Abstract 

Given a tree T on v vertices and an integer k > 2 one can define the fc-expansion as a 
fc-uniform linear hypergraph by enlarging each edge with a new, distinct set of k — 2 vertices. T( fc ) 
has v + (v — l)(fc — 2) vertices. The aim of this paper is to show that using the delta-system method 
one can easily determine asymptotically the size of the largest T^-free n- vertex hypergraph, i.e., 
the Turan number of T^ k \ 

1 Definitions: kernel-degree, Turan number 

As usual, a hypergraph H = (V, J 7 ) consists of a set V of vertices and a set T = E(H) of edges, 
where each edge is a subset of V. We call edges of H members of T . If each member of J 7 is a 
^-subset of V, we say that if is a k-uniform hypergraph or J 7 is a k-uniform set system. Thus to 
simplify notation we frequently identify the hypergraph H to its edge set T '. If |V| = n, it is often 
convenient to just let V = [n] = {1, . . . , n}. We also write J- C (, ) to indicate that J 7 is a /c-uniform 
hypergraph, or k-graph for short, on vertex set V. So (Y) denotes the complete /c-graph on vertex 
set V. A set S C V is a transversal (or vertex-cover) of the (hyper)graph H = (V, £) if S R E ^ for 
all E € £. Let r := t(H) denote the minimum number of vertices to cover all edges of H, i.e., the 
transversal number of H. Consider a set of edges of A4 C E(H), it called a matching if it consists of 
disjoint members of E(H). v{H) denotes the matching number of H, i.e., the maximum number of 
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pairwise disjoint edges of if. A family of sets F\, . . . ,F S are said to form a A.- system of size s with 
kernel C if Fj n Fj = C for all 1 < i < j < s. 

Given a family T C (^) and a subset W C [n], we define the degree of W in T as 

degjr(W) = |{F:FG J,^CF}|. 

The hypergraph {F : F E J 7 , W C F} is denoted by So deg^W) = |F[iy]| and deg^(0) = 

|F|. 

We define the kernel degree of W, denoted by deg^r(W), as 

deg^(W) = max{s : 3 a A-system of size with kernel W in J 7 }. 

In other words, degjr(W) is the matching number of {F \ W : W C F G £"}. 

Given a family % = {Hi, H<i, . . . } of hypergraphs, the k-uniform hypergraph Turdn number of 7~L, 
denoted by ex(n,7~L), is the maximum number of edges in a /c-uniform hypergraph T on n vertices 
that does not contain a member of T-L as a subhypergraph. If we want to emphasize k, then we 
write exfc(n, H). An Ti-fiee family T C is called extremal if |F| = ex(n,7i). If 7^ consists of a 
single hypergraph H, we write ex(n,H) for ex(n, {F}). Surveys on Turan problems of graphs and 
hypergraphs can be found in [14J and [20] , 

It is easy to show (see, e.g., Bollobas j2], p. xvii, formula (0.5)) that any graph G = (V,£) with 
more than (5 — f)\V\ edges contains an induced subgraph G' with minimum degree at least 5. Then 
G' contains every tree of 8 + 1 vertices. We have 

ex(n, T) < (v - 2)n, (1) 

where T is any v- vertex forest, v > 2. 

We have for integers 6>a>0, b > t > 1 that 

/o\ _ a /a - 1\ a /6 - 1\ _ a /6\ 
VV ~ l\t-l) -t\t-l) ~b\t)' 

This implies the following lemma. 
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Lemma 1.1 Suppose that z\ > Z2 > • • • > z m and t are non-negative integers, z\ > t > 1. Then 

l<i<m v 7 1 x ' 

2 Preliminaries: matchings, paths, stars 

The Erdos-Ko-Rado theorem that says that for all n > 2k the maximum size of a /c-uniform family 
on n vertices in which every two members intersect is > with equality achieved by taking all the 
subsets of [n] containing a fixed element. If we let M„ denote the /c-uniform hypergraph consisting 
of v disjoint fc-sets, then the Erdos-Ko-Rado theorem says exj.(n, ) = (tZi) f° r an n — More 
generally, Erdds [5] showed for any positive integers k, v there exists a number n(/c, v) such that the 
following holds. For all n > n(k, v), if T C contains no z/ + 1 pairwise disjoint members then 

/n\ (n — V s . 
\A< J" ,„ ■ (3) 



Furthermore, the only extremal family T consists of all the k-sets of [n] meeting some fixed set S of 
i/ elements of [n]. 

Note that the value of n(2, r>) was determined by Erdos and Gallai [7J, and more recently n(3, v) 
was determined by Luczak and Mieczkowska [21] (for v > vq ~ 10 5 ). We only use the following 
upper bound due to Frankl |10| which holds for every n and every family T C 

W<W"Y (4) 



Jfe-1 

Summarizing, for fixed fc and v as n — > oo we have that 

ex fc (n,M( fc )) = (z, + o(l))^"j). (5) 

A linear path of length £ is a family of sets {F\, . . . ,Fi} such that \Fi n = 1 for each i 

and i^j nFj = whenever \i — j\ > 1. Let 7-^ denote the fc-uniform linear path of length t. It is 
unique up to isomorphisms. Note that this notation is different from what is usually used, where 
P v denotes a u-vertex path. Concerning the graph case (k = 2) Erdos and Gallai [7] proved that 
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ex2(n, ) < \(l — l)n. Here equality holds if G is the disjoint union of complete graphs on I 
vertices. The value of ex2(n,7-^ ) was determined for all n by Woodall [3D] and Kopylov |23j. 

Concerning linear paths of two edges Erdos and Sos [6] proved for triple systems (k = 3) that 
ex3(n, V<2 ) = n or n — 1 (according to n is divisible by 4 or not and n > 4). They conjectured that 

ex fc (n,pf)=^" 2 2 ) (6) 

for fc > 4 and sufficiently large n with respect to A;, and this was proved by Frankl [9j. The case 
k = 4 was finished for all n by Keevash, Mubayi, and Wilson [22J. 
The case £ < k was asymptotically determined in |12j . 

Since the paper by G. Y. Katona and Kierstead pjj] (1999) there is a renewed interest concerning 
paths and (Hamilton) cycles in uniform hypergraphs. Most of these are Dirac type results (large 
minimum degree implies the existence of the desired substructure) like in Kiihn and Osthus |21j . 
Rodl, Rucihski, and Szemeredi [2"9"j . 

The present author, Tao Jiang, and Robert Seiver |17| determined exfc(n, Vp^) exactly, for all 
fixed k,£, where k > 4, and sufficiently large n proving 

,(fe) n f n — 1\ f n — 2\ / n — t 



where the only extremal family consists of all the fc-sets in [n] that meet some fixed set S of t 
elements, and 



^o=(i: 1 1 )HrD + - + (M) + C , ^ 2 i' (8) 

where the only extremal family consists of all the A;-sets in [n] that meet some fixed set S of t elements 
plus all the fc-sets in [n] \ S that contain some two fixed elements. 'Sufficiently large' n means that 
([7]) and ([8]) hold when kt = O(loglogn). It is conjectured that they hold for all (or at least almost 
all) n's. The method in |17| does not quite work for the k = 3 case (cf. the remark after Lemma [6.2 
below) but it is conjectured that still a similar result holds for k = 3. 

A {linear) star of size I with center x is a family of sets {iq, . . . , Fp} such that x £ iq for all i but 
the sets F{ \ {x} are pairwise disjoint. Let <sl^ denote the fc-uniform star of size £. It is obvious that 
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ex2(5^ ,n) = [(£ — l)n/2\ (for n > £). Chung and Frankl [3J gave an exact formula for ex3(<S^ , re) 
for n > 3£ 5 . The following asymptotic was proved for any fixed £ > 2, k > 5 in |12| . 

eMn,sf ) ) = {^{t) + o{l))^ k ~_ 2 ^, (9) 

where ip(£) = £ 2 — i for £ is odd and it is £ 2 — |^ when £ is even. 

According to the above mentioned result of Chung and Frankl ([9]) holds for k = 3 too. The order 
of magnitude ex4 (n,S l f ) ) = VL(£ 2 n 2 ) was also proven in [12] , and it is conjectured that @ holds for 
k = 4 too. 



3 Generalized /c-forests, an upper bound 

Let us define a generalized k-forest in the following inductive way. Every fc-graph consisting of a 
single edge is a /c-forest. Suppose that T = {E\, E2, ■ ■ ■ , E u } C (Y) is a /c-forest and suppose that 
j4 := A u+ i C £7j for some I <i <u, and i? n V = 0, [A| + \B\ = k, then E2, . . . , -E u > is a 

/j-forest with := A U -B. If it is connected then it is called a generalized fc-tree. In that case all 

defining sets A2, ■ ■ ■ ,A u+ \ are nonempty. Note that for graphs {k = 2) the above process leads to 
the usual notions of forests and trees. If each defining set A% is a singleton or empty then we obtain 
a linear forest, if each defining set is either empty or has k — 1 elements, then we get a tight forest. 
A forest T of q edges has at least q + k — 1 vertices and here equality holds if and only if T is a tight 
k-tree. 

Theorem 3.1 Suppose that T is a generalized k-forest of p vertices and q edges. Then 

ex fc (n,T)<(p-l)2^ 1 (^ i ). (10) 

Proof. Suppose that % C (^) avoiding the /c-forest T = {E\, . . . ,E q } and set A u+ \ := E u+ \ n 
(Ei U • • • U E u ), 1 < u < q — 1. Define a list of hypergraphs Ho := H D Hi D ■ ■ ■ D 7i m and sets 
Xi, . . . , X m , simultaneously as follows. 

If rim = we stop. If one can find a set X C [re] such that \X\ < k and deg^ m (X) < (p — 1) 
then let X m+1 := X and rl m+ i := rl\ rl m [X\. If there is no such set X then we stop. 
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We claim that H m should be the empty family. Otherwise, we can embed T into H m as follows. 
Start with any edge E\ E Tim- We define the other edges E 2 , ■ ■ ■ ,E q one by one. Observe that for 
any subset X C E E H m we have deg^ m (X) > p. Suppose that E\, . . . , E u had already been defined 
together with A 2 , ■ ■ ■ , A u . Locate A u+ \ in E\ U • • • U E u . Since deg^ m {A u+ i) > p > \E\ U • • ■ U E u \ 
there is an E := E u+ \ € such that E \ A u+ \ is disjoint to E\ U • • • U E u . 

Note that in the sequence X\, . . . , X m there is no repetition, so using (jl]) we get 

i i 1*1 / X:\X\<k V 11/ 

Note that Theorem 13.11 gives the correct order of magnitude if PlT = 0, since then (?Zi) i s a 
lower bound. However, the determination of the best coefficient of the binomial term seems to be 
extremely difficult. Erdos and Sos conjectured for graphs (i.e., k = 2) and Kalai 1984 for all k, see 
in [12], that for a v- vertex tight tree T 

v — k ( n 

ex fc (n,7 ) < 



k \k-l 

For any given tight tree T a matching lower bound, i.e., (1 — o(l)) times the conjectured upper 
bound, can be given for n — > oo as follows. Consider a P(n, v — 1, k — 1) packing Pi, . . . , P m on the 
vertex set [n] (i.e., |Pj| = v — 1 and |Pj fl P,-| < /c — 1 for 1 < i < _/ < m) and replace each Pj by 
a complete /c-graph. We obtain a T-free hypergraph. Then Rodl's |28j theorem on almost optimal 
packings gives 

e Xl («,T) > (1 - o,l))g x («- ') = (1 + o,,),^ » x ). 

The Erd6s-S6s conjecture has been recently proved by a monumental work of Ajtai, Komlos, 
Simonovits, and Szemeredi [I], for u > uo- 

The Kalai conjecture has been proved for star-shaped fc-trees in [12], i.e., whenever 7~ contains a 
central edge which intersects all other edges in k — 1 vertices. For k = 2 these are the diameter 3 
trees, 'double stars'. 
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There is only one more class of fc-trees where exact asymptotic is know, namely what is called 
an intersection condensed family. For such an T we denote | n7~| by p^, and the number of vertices 
of degree at least two by P2 and suppose that 2poo + P2 + 2 < k (Theorem 5.3 in |12j). 

There are many different definitions of a 'path' in a hypergraph. Gyori, G. Y. Katona, and 
Lemons [18] determined the exact value of the Turan number of the so-called i?erge-paths for infinitely 
many n's. Mubayi and Verstraete |26j gave good bounds for the Turan number of /c-uniform loose 
paths of length I. 

The aim of this paper is to present the best coefficient for a wide class of linear trees, thus 
generalizing the results in the previous section about matchings (0), paths ©-(IE]) and stars Q. 

4 The main result, finding expanded forests in &;-graphs 

Given a graph H, the k-blowup (or /c-expansion) , denoted by [ii~]( fc ) (or H( k > for short), is the k- 
uniform hypergraph obtained from H by replacing each edge xy in H with a fc-set E xy that consists of 
x, y and k — 2 new vertices such that for distinct edges xy, x'y', (E xy — {x, y}) D {E x / y i — {x', y'}) = 0. 
If H has p vertices and q edges, then has p + q(k — 2) vertices and q hyperedges. The resulting 
is a fc-uniform hypergraph whose vertex set contains the vertex set of H. 
Given a forest T define the following 

cr(T) := min{|X| + e(T \ X) : X C V(T) is independent in T}. (11) 

Here T \ X is the forest left from T after deleting the vertices of X and the edges incident to them, 
e(G) stands for the number of edges of the graph G. Since the edges avoiding X can be covered 
one by one we have that t(T) < o~(T) but here equality should not hold. For example, if T consists 
of a path of four vertices ai 6162^2 with 2d + 2c pendant edges such that d > c > 1 and each 
has d degree-one neighbors and each 6j has c of those, then we can easily see that r(T) = 4 but 
a(T) = 2c + 3. 

Theorem 4.1 Given a forest T with at least one edge and an integer k > 4. Then we have as 
n — > 00, that 

ex(n,r( fe )) = (a(T)- 1 + (1))^Y (12) 
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Our result, naturally, gives the same asymptotic as Theorem 5.3 in |12] whenever both can be 
applied to T^ k \ We conjecture that (fTT]) holds for k = 3, too. 

According to ([!]) (and the remark after that) the above asymptotic holds for stars. For every 
other forest a > r > 2. From now on, we usually suppose that r(T) > 2. 

Let us note that Mubayi [25J and Pikhurko [27] determined precisely (for large n) the Turan 
number of the /c-expansion of some other graphs, namely for the complete graph Kg for I > k > 3. 
For smaller values of I we know that ex^iTg^, n) = C£~i) f° r n > n o(k), k > 5, a former conjecture 
of Chvatal and Erdos established in [12]. 

5 The product construction 

Given two set systems (or hypergraphs) A and B their join is the family {^4 L) B : A £ A, B £ B}. 
We denote this new hypergraph by A n B. 

Call a set Y 1-cross-cut of a family C if \Y D E\ = 1 holds for all E € C. Define T\{C) as the 
minimum size of a 1-cross-cut of C (if such cross-cut exists, otherwise n := oo). One can see that 
every forest T and A; > 3 the following holds. 

a(T) = T 1 (T^). (13) 

Thus o~(T) is the minimum size of a set y such that can be embedded into (^) x (^i) where 
Y and Z are disjoint sets, \Z\ > /eg. This means that in case of Y := [a — 1], Z := [n] \ Y the 
hypergraph (^) x J does not contain any copy of T^ k \ We obtain the lower bound 

ex(n, T( fc )) > | Q h^) | = |{E : £ € ( [ ^ ] ) , |E n [a - 1] | = 1}| = (a- 1) ( n " °_ + X ) . (14) 

6 The graph of 2-kernels, starting the proof with the delta-system 
method 

Given a family J 7 C the kernel-graph with threshold s is a graph G := G2, S {J~) on [n] such 

that Vx,y S [n], € E{G) if and only if degjr({:r, y}) > s. The following (easy) lemma shows the 
importance of this definition. 
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Lemma 6.1 (see |17j ) Let H be a graph with q edges, s = kq, and let T C (^). Let G2 be the 
kernel graph of T with threshold s. If H C G2, then T contains a copy of H^ k \ □ 

The delta-system method, started by Deza, Erdds and Frankl [4J and others, is a powerful tool 
for solving set system problems. Using a structural lemma from |13j and the method developed 
in |11[ [12] the following theorem was obtained in [T7J (see Theorem 3.8 and the proof of Lemma 4.3 
there). 

Lemma 6.2 (see [17] ) Let T C (^), T a forest of v vertices, s = kv, G2 := G2(F), and suppose 
that J- does not contain T^ k \ Then there is a constant c := c(k,v) and a partition T = T\ U with 
the following properties. 

- \^\<4T-D- 

Every edge F € T% has a center (not necessarily unique) x(F) £ F such that G2I-F contains a 
star of size k — 1 with center x{F). In other words, {x(F),y} £ E{G-z) for all y € F \ {x(F)}. □ 

Actually, the delta-system method describes the intersection structure of J 7 in a more detailed 
way, but for our purpose this lemma will be sufficient. The above lemma (and in fact the main result 
of this paper, Theorem 13. 1 j> preceded ([7])-([8~]), see [15], but since the proof of Lemma [6.21 are now 
available in [T7] we omit the details here. 

Note that this is the only point where k > 4 is used. Lemma 16.21 is not true for k = 3. The 
3-graph J 73 obtained by joining a matching of size t and t one-element sets has n = 3t vertices, 
t 2 = n 2 /9 = il(n fc_1 ) edges, it does not contain any linear tree except stars but G2, S (F 3 ) forms a 
matching for every s > 2. 

7 Proof of the Main Theorem 

Suppose that T C avoids the fc-expansion of the ^-vertex forest T, k > 4. We are going to give 
an upper bound for As noted above we may suppose that T is not a star, <r(T) > r(T) > 2. 

Define s = vk and let G2 be the kernel graph with threshold s with respect to family T as defined 
in the previous Section. This graph avoids T by Lemma 16. 11 so ([1]) implies 



e(G 2 ) <(v- 2)n. 



(15) 
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Consider the degree sequence of G2 and suppose that 
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deg(xi) > deg(x 2 ) > ••• > deg(x n _i) > deg(x n ). 

Let L := {x±, . . . , xi} be the set of highest degrees. We will define I later as n £ so keep in mind that 
it is relatively large. Using ([15]) we obtain 

, / v ^ deg(xi) + • • • + deg(s m ) 2e(G) 2(v - 2)n f . 

z := de gG2 (x i+1 ) < _ < y-j-y < (16) 

Consider the partition JF = T\ U F% given by Lemma 16.21 Let F3 be the edges of T2 with center 
outside L. Using Lemma 16.21 and ([2]) then (I15D . a triviality and (I16D we get 



^ v A;-i 7 ~ 2 u - 1 

Ki<n v 7 



2(t;-2)n / z \ 2(u - 2)n fc _ 2 2^ - 2)^ n^ 1 

z Vfc-v {k-l)l ~ (fc-1)! ^ fc - 2 ' 1 j 

Every edge of J 7 \ (J 7 ! U J-3) meets L. Let J-4 be the set of members of J- meeting L in at least 
two vertices. Obviously 

1 1 ~ V 2 / \ k - 2/ 2 x (jfe - 2)! v ; 

The edges of T \ [T\ U J3U J-4) meet L in exactly one element. Let J-5 be the family of edges of 
J 7 satisfying \F f] L\ = 1 and degjr(F \ L) < a — 1. Obviously, 

wi £(-!)(:: 0- pa) 

The rest of the edges, i.e., those from .Fg := J 7 \ (J 7 ! UJ3UJ4U J-5) are of the form F = {a} U -B 
where a£L, BC\L = $ and degj-(-F \ L) > <r. For every set A 6 ( ) define as the /c — 1 uniform 
family 

B A := {B : {a} U B G J" for all a G A}. 

Also set 

T A := {F G T : a G A, B G B A , and {a} U B = F}. 
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We have C Ua^a where \A\ = a, A C L. 

Consider T^ k \ As noted in Section [5j there is a 1-cross-cut, a set Y of size a meeting each k-edge 
of in a singleton. Let C be the (k — l)-uniform hypergraph obtained by deleting the elements 
of Y from the edges of T^ k \ C := {E\Y : E £ E(T^)}. Since F A does not contain we have 
that Ba can not contain C as a subhypergraph. Also, C is a generalized forest of at most v edges so 
Theorem Ogives \B A \ < (v - l)(k - l)2 fc - 2 ( fc ™ 2 ) . We obtain 



fc-2 



n 

fc-2 



!^6l< £ 1^1 = ^ £ |Ba| < crC^ (w - l)(Ar - 1)2 

Ml) Ml) w 

Finally, since J2 C J3LU4UJ5U 7^ we have 

1^1 < l^il + l^sl + l-^i! + |-F 5 | + l^el- 
Using the first part of Lemma ES 03), (USD, (USD and ([H we obtain 

,n k 1 , „ , „ 1* — o-. , ^/n — £\ ;._ >, 



(20) 



[jr| < ( n ^) + 0(-^) + 0(W) + (a - 1) I _ H + O^/^). (21) 

Defining t ~ n 1 /^ 1 ) we obtain that the sum of the OQ terms in (J2TJ) is 0(n^ k ~ 1 ^~ l ^ tr+1 ^) = o(n fc_1 ) 
and we are done. 



8 Further problems 

With a refined version of the above proof one can see that 



ex(n, T«) = {a - 1) \ + 0(n k - 2 ). 



It seems to be a solvable problem to determine the exact value of this Turan number (for n > no(T, k)) 
as it was done in [17] for paths. 
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